Rank in Wordlist | Frequency | Word |
---|---|---|
7375 | 221 | 1,5 |
8628 | 182 | 2,5 |
13382 | 102 | 3,5 |
15802 | 81 | 0,5 |
16126 | 79 | 1,2 |
16914 | 74 | 4,5 |
17902 | 68 | 6,5 |
17903 | 68 | 8,5 |
19743 | 59 | 1,8 |
22100 | 50 | 1,3 |
Rank in Wordlist | Frequency | Word |
---|---|---|
9391 | 163 | étudiant(e)s |
11310 | 128 | un(e |
13793 | 98 | candidat(e |
15703 | 82 | candidat(e)s |
17762 | 69 | ami(e)s |
20142 | 58 | le(s |
25033 | 42 | employé(e)s |
26409 | 39 | participant(e)s |
27154 | 37 | bisexuel(le)s |
28220 | 35 | d'un(e |
Rank in Wordlist | Frequency | Word |
---|---|---|
9391 | 163 | étudiant(e)s |
15703 | 82 | candidat(e)s |
17762 | 69 | ami(e)s |
25033 | 42 | employé(e)s |
26409 | 39 | participant(e)s |
27154 | 37 | bisexuel(le)s |
34338 | 26 | handicapé(e)s |
36120 | 24 | enseignant(e)s |
40866 | 20 | professeur(e)s |
43894 | 18 | �tudiant(e)s |
Rank in Wordlist | Frequency | Word |
---|---|---|
98929 | 4 | 5%/alc |
115054 | 3 | 5.2%/alc |
140049 | 2 | 18-23%)13,29 |
140313 | 2 | 2%)3 |
141018 | 2 | 4,8%/alc |
141040 | 2 | 4.8%/alc |
141291 | 2 | 5-7%1,9,17,20 |
193816 | 1 | 10,2%o |
193912 | 1 | 100%est |
193913 | 1 | 100%vision |
Rank in Wordlist | Frequency | Word |
---|---|---|
13503 | 101 | R&D |
34696 | 25 | B&B |
41959 | 19 | km² |
45934 | 16 | Rx&D |
47257 | 15 | AT&T |
60849 | 10 | R&B |
60858 | 10 | RS&DE |
69581 | 8 | SF&F |
73385 | 7 | Bq/m³ |
74560 | 7 | M&H |
Rank in Wordlist | Frequency | Word |
---|---|---|
74559 | 7 | M$US |
79537 | 6 | 7$/adulte |
87783 | 5 | 3$/enfant |
98889 | 4 | 40$/pers |
115058 | 3 | 50$/pers |
119709 | 3 | INVE$TFOLIO |
139465 | 2 | 1,00$/minute |
139586 | 2 | 100$CAN |
139907 | 2 | 15.00$/pers |
139915 | 2 | 1500$CAD/motoneige |
Rank in Wordlist | Frequency | Word |
---|---|---|
98745 | 4 | 20"×25 |
98880 | 4 | 4"1/2 |
115040 | 3 | 5"×7 |
135813 | 3 | qu"on |
139270 | 2 | -Historique"a |
141842 | 2 | 8½"×14 |
141885 | 2 | 81/2"×14 |
157024 | 2 | Musique"a |
158434 | 2 | PROF"et |
159597 | 2 | Programmation"-- |
Rank in Wordlist | Frequency | Word |
---|---|---|
64 | 22556 | d'un |
69 | 20289 | d'une |
164 | 9194 | qu'il |
171 | 8792 | C'est |
198 | 7797 | c'est |
212 | 7453 | n'est |
247 | 6333 | d'autres |
455 | 3730 | l'information |
464 | 3659 | s'est |
478 | 3562 | qu'ils |
Rank in Wordlist | Frequency | Word |
---|---|---|
69846 | 8 | VIVA+Clinic |
87614 | 5 | 1+800+SANDMAN |
87732 | 5 | 2+3 |
91237 | 5 | Power+Plus |
102591 | 4 | LibQUAL+CM |
107202 | 4 | ctrl+c |
111184 | 4 | n+1 |
111775 | 4 | posi+if |
117646 | 3 | Ctrl+C |
117805 | 3 | DVD+CD |
Rank in Wordlist | Frequency | Word |
---|---|---|
41207 | 19 | Con*Cept |
99395 | 4 | Ani*m�les |
116659 | 3 | CA*net |
116660 | 3 | CA*net2 |
116661 | 3 | CA*net3 |
116662 | 3 | CA*netII |
124150 | 3 | SQL*Plus |
139613 | 2 | 1024*768 |
139824 | 2 | 1366*768 |
193012 | 1 | 0.5*10-3 |
Rank in Wordlist | Frequency | Word |
---|---|---|
1473 | 1351 | et/ou |
7104 | 233 | 1/2 |
8123 | 197 | VIH/sida |
9610 | 158 | http://www |
11436 | 126 | km/h |
17698 | 69 | 1/4 |
21539 | 52 | 2/3 |
23358 | 46 | 3/4 |
28132 | 35 | VIH/SIDA |
28505 | 34 | 1/3 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots